Towards understanding heterogeneous clouds at scale: Google trace analysis

نویسندگان

  • Charles Reiss
  • Alexey Tumanov
  • Gregory R. Ganger
  • Randy H. Katz
  • Michael A. Kozuch
چکیده

With the emergence of large, heterogeneous, shared computing clusters, their efficient use by mixed distributed workloads and tenants remains an important challenge. Unfortunately, little data has been available about such workloads and clusters. This paper analyzes a recent Google release of scheduler request and utilization data across a large (12500+) general-purpose compute cluster over 29 days. We characterize cluster resource requests, their distribution, and the actual resource utilization. Unlike previous scheduler traces we are aware of, this one includes diverse workloads – from large web services to large CPU-intensive batch programs – and permits comparison of actual resource utilization with the user-supplied resource estimates available to the cluster resource scheduler. We observe some under-utilization despite over-commitment of resources, difficulty of scheduling high-priority tasks that specify constraints, and lack of dynamic adjustments to user allocation requests despite the apparent availability of this feature in the scheduler.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GloudSim: Google trace based cloud simulator with virtual machines

In 2011, Google released a one-month production trace with hundreds of thousands of jobs running across over 12,000 heterogeneous hosts. In order to perform in-depth research based on the trace, it is necessary to construct a close-to-practice simulation system. In this paper, we devise a distributed cloud simulator (or toolkit) based on virtual machines, with three important features. (1) The ...

متن کامل

3D Animation of Asian dust clouds in the Google Earth

The widely spread Asian dust over East Asia has significant effects on the climate change and fine dust particles in the atmosphere have harmful influence on our health. It is, therefore, important to understand when and where sand dust particles were released and through which paths they were transported to other areas. However, it is difficult to show the movement of 3D objects on the global ...

متن کامل

Parameterisation of orographic cloud dynamics in a GCM

A new parameterisation is described that predicts the temperature perturbations due to sub-grid scale orographic gravity waves in the atmosphere of the 19 level HadAM3 version of the United Kingdom Met Office Unified Model. The explicit calculation of the wave phase allows the sign of the temperature perturbation to be predicted. The scheme is used to create orographic clouds, including cirrus,...

متن کامل

Testing of Cloud Applications in the Cross-Cloud Environment

Cloud computing is the new paradigm to deliver all the hosted services over internet on demand. The ultimate goal of cloud computing paradigm is to realize computing as a utility. The cloud is rapidly maturing towards its goal to support a wide variety of enterprise and consumer services and real-world applications. Recently a movement towards cross cloud also called as multi-clouds or inters c...

متن کامل

Cloud Sustainability Dashboard, Dynamically Assessing Sustainability of Data Centers and Clouds

External Posting Date: September 6, 2011 [Fulltext]. Approved for External Publication  Cloud Sustainability Dashboard, Dynamically Assessing Sustainability of Data Centers and Clouds Cullen Bash, Tahir Cader, Yuan Chen, Daniel Gmach, Richard Kaufman, Dejan Milojicic, Amip Shah, Puneet Sharma HP Laboratories HPL-2011-148 Sustainability; Clouds; monitoring; and modeling. Quantifying and underst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012